Entropic test of quantum contextuality 
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We study the contextuality of a three-level quantum system using classical conditional entropy of 
measurement outcomes. First, we analytically construct the minimal configuration of measurements 
required to reveal contextuality. Next, an entropic contextual inequality is formulated, analogous to 
the entropic Bell inequalities derived by Braunstein and Caves in [Phys. Rev. Lett. 61, 662 (1988)], 
that must be satisfied by all non-contextual theories. We find optimal measurements for violation of 
this inequality. The approach is easily extendable to higher dimensional quantum systems and more 
measurements. Our theoretical findings can be verified in the laboratory with current technology. 



Introduction In information theory the relation be- 
tween two events can be quantified using the notion of 
conditional entropy. This notion was successfully ap- 
plied to study the relation between space-like separated 
measurements on different parts of an entangled quan- 
tum system and it was shown that conditional entropies 
for measurements on such systems do not obey classical 
properties of entropy pQ, which is yet another manifes- 
tation of quantum nonlocality [2]- Since quantum non- 
locality is only a special case of quantum contextuality 
[3] it is natural to ask whether classical properties of en- 
tropy also fail do describe measurements in contextual 
scenarios. 

The notion of contextuality, as introduced by Kochen 
and Specker (KS) [3], can be explained as follows. Sup- 
pose that a measurement A can be jointly performed with 
either B or C . Measurements B and C are said to pro- 
vide a context for the measurement A. The measurement 
A is contextual if its outcome does depend on whether it 
was performed together with B or with C . Therefore, the 
essence of contextuality is the lack of possibility to assign 
an outcome to A prior to its measurement and indepen- 
dently of the context in which it was performed. The 
crucial observation by KS [3] was that quantum theory 
is contextual for any system whose dimension is greater 
than two. The seemingly different Bell theorem [3] is in 
fact a special instance of the KS theorem where contexts 
naturally arise from the spatial separation of measure- 
ments. 

In mathematical terms quantum nonlocality and con- 
textuality can be formulated in terms of probability 
theory. Specifically, the reason behind both Bell and 
KS theorems is the lack of joint probability distribu- 
tion for all measured observables [5HZ]. For example, 
imagine some physical system on which one can per- 
form various measurements denoted by A\, A%, . . . , An. 
Each measurement Ai yields an outcome on j t (where ji 
enumerates outcomes) with probability p(Ai — aij { ). 
The non-contextuality hypothesis is true if and only 
if there exists a joint probability distribution for the 
outcomes of all observations, i.e., p(Ai = aw^-Ag = 



Q2,j 2 , ■ ■ ■ , An = a>N,j N )> such that one can recover all the 
measurable probabilities as its marginals. For instance, 
consider that the subset of measurements { A kl , . . . , A kl } 
with {fci, . . . , ki} C {1, . . . , N} can be jointly performed, 
in other words, the probability distribution p(A kl — 
ak lt j hl ,...,Ak, = aki,j H ) can be experimentally deter- 
mined. The non-contextuality hypothesis then requires 
that the joint probability distribution for all measure- 
ments, p(Ai = aij 1 ,A 2 = a 2 ,j 2 ,- ■ ■ ,A N = a N j N ) recov- 
ers any such p(A kl = a kl Jki , . . . , A kl = a kl ,j kl ) as its 
marginal, i.e., 

p(A kl = a kl Jki , . . . , A kl = a kl Jki ) = 

^2 = a iJi' ■ ■■>A N = a N ,j N )- 

Here the summation is over the outcomes of all the mea- 
surements Aj that are not in the jointly measurable sub- 
set. From hereon, we denote probabilities by p(Aj) in- 
stead of p(Ai — ai) for notational convenience, wherever 
there is no possibility of confusion. 

An important question is the minimal number of 
measurements on some quantum system that one has 
to perform in order to observe contextuality and pre- 
vent the existence of their joint probability distribution. 
The currently known most economic proofs for a three- 
dimensional system (qutrit) consist of 5 measurements 
in case of a state-dependent test [8] and of 13 measure- 
ments in case of a state-independent test [9j. Qutrits 
are of special interest since they are not only the small- 
est contextual systems, but they also physically corre- 
spond to a single system to which the concept of non- 
locality and entanglement cannot be unambiguously ap- 
plied. Therefore, a single qutrit together with the most 
economic set of contextual measurements can be consid- 
ered as a primitive of quantum contextuality in a sim- 
ilar sense as an entangled pair of qubits together with 
the CHSH (Clauser-Horn-Shimony-Holt) inequality [TU] 
is considered as a primitive of quantum nonlocality. 

In this Letter we analytically find the minimal num- 
ber of contextual measurements for a single qutrit. A 
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FIG. 1: Graphical notation of commutation relations. Ver- 
tices of the graph correspond to observables and edges repre- 
sent commutativity. 



new graph theoretical method to construct joint proba- 
bility distributions is presented and it is proven that for 
sets of measurements whose corresponding graphs do not 
contain cycles (i.e., there are no subsets of measurements 
that cyclic-commute) there always exists a joint probabil- 
ity distribution reproducing quantum marginals. Next, 
using the minimal set of measurements we formulate an 
cntropic contextual inequality in the spirit of [T] . Finally, 
we find measurement settings for an optimal violation of 
this inequality for a single qutrit. 

Minimal number of contextual measurements In or- 
der to find the minimal number of contextual measure- 
ments one is tempted to start with two measurements 
A and B. However, this does not work because (i) 
either A and B commute in which case quantum me- 
chanics itself provides a joint probability distribution, 
or (ii) A and B do not commute and one can simply 
write p(A,B) = p(A)p(B), a joint probability that re- 
produces the marginal probabilities p(A) and p(B) . Ob- 
serve that a single particle in one dimension for which 
measurements of position and momentum do not com- 
mute is not contextual because of (ii). Next, consider 
three measurements: A, B and C. The various scenarios 
are as follows: (i) All three measurements mutually com- 
mute, which is equivalent to (i) for two measurements (ii) 
All of them do not commute, which allows us to define 
p(A, B, C) — p(A)p(B)p(C) (iii) Only one pair commutes 
(A and B) in which case the joint probability distribution 
is p(A,B,C) = p(A,B)p(C), where p(A,B) is provided 
by quantum mechanics, (iv) One pair of them {B and 
C) does not commute in which case one may construct 
p(A, B, C) = p(A, B)p{A, C)/p(A). This joint probability 
distribution reproduces all measurable marginals, there- 
fore the system that has only two contexts is not suffi- 
cient to refute non-contextuality. The next case of four 
measurements was shown to be sufficient to prove this 
discrepancy for a system of dimension at least four, and 
is known as the CHSH inequality [TP] . 

Can we show the discrepancy for a three-level sys- 
tem and only four measurements? To show that the 
answer is no, it is convenient to introduce graphic no- 
tation as in Fig. [T] Each vertex represents a mea- 
surement and edges represent commutation between the 
connected measurements. The only significant scenar- 



ios that do not reduce to previous considerations are 
represented by the chain graph (in Fig. [I] (i)), the 
star graph (ii) and the cycle (iii). For (i) we construct 
p(A,B,C,D) = p(A,B)p(B,C)p(C,D)/(p(B)p(C)), for 
(ii) p(A,B,C,D) = p(A,B)p(A,C)p(A,D)/(p(A)p(A)). 
Note that the probabilities on the right-hand side of these 
equations exist due to the assumption of joint measura- 
bility. Measurements corresponding to the graph in Fig. 
[T] (iii) do not exist for a three-level system. This is be- 
cause in order to have [A, B] = and [A, D] = 0, but 
[B, D] 7^ one requires A to be a degenerate operator. 
In the case of a three-level system, this means that two 
eigenvalues of A are the same and therefore, without loss 
of generality A can be set to be a projector of rank one. 
Therefore all four measurements A, B, C and D are rank 
one projectors. The cycle graph (iii) implies that both A 
and C are orthogonal to B and D. Since we require that 
B 7^ D, these two projectors span a plane orthogonal to 
both A and C which in three-dimensional Hilbert space 
implies A = C . The problem then reduces to case (iii) 
for three measurements. 

For three-level systems one requires at least five pro- 
jective measurements to show the lack of joint probability 
distribution. Before we proceed, let us prove one prop- 
erty of the construction used above, namely that for any 
commutation graph which does not contain cycles (tree 
graph) there always exits a joint probability distribution 
consistent with quantum theory. This construction is 
given by the product of probability distributions corre- 
sponding to the edges of the graph (denoted by the set 
E(G)) divided by the product of probabilities of common 
vertices, where a vertex i £ V(G) (the set of vertices of 
the graph) of degree d(i) (the number of nearest neigh- 
bors) appears d(i) — 1 times in the product, i.e., 



p(A u ...,A N ) = 



n 



p(Ai,Aj 



n ie y (G) p(^) rf(i) - 



Since quantum theory provides joint probability distribu- 
tions for any two commuting observables, this construc- 
tion recovers any measurable marginal as can be seen by 
summing over all other observables, starting the summa- 
tion from the leaves (free ends of the tree). For example, 
for the instance presented in Fig. [2] (i) the joint proba- 
bility distribution is 



p{A u ...,A 7 ) 



p(A 1 ,A 2 )p(A 1 ,A 3 )p(A 2 , A 4 )p(A 2 , A 5 ) 

p{A 1 )p{A 2 f 
p(A 3 ,A 6 )p(A 3 ,A 7 ) 



and for instance to recover ^1(^2,^5) the summation or- 
der is Ar, Ae, A4, As, A\. In fact, the construction can 
also be applied to the scenario when the commutation 
graph apart from the tree structure also contains cliques 
(a clique is a fully conntected subgraph which by defini- 
tion is a jointly measurable subset). 
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FIG. 2: Two examples of graphs of observables admitting 
joint probability distributions. 

Since all open graphs are trees for which joint proba- 
bility distribution exists and for a three-level system one 
requires at least five projective measurements, the min- 
imal graph for which one can show the discrepancy is a 
pentagon (5-cycle). For other graphs with cycles smaller 
than five, such as the example in Fig. 2 (ii), one can 
always find joint probability distributions, for example 

(A , s P (A 1 ,A 2 ,A 3 )p(A 3 ,A 4 )p(A 4 , A 5 ) 

p(A u ...,A 5 )= . 

p(A 3 )p(A 4 ) 

The similar case with a square (4-cycle) does not work 
due to reasons already discussed. This analytic result 
confirms observation in [5] that projectors corresponding 
to the 5-cycle are necessary and sufficient to reveal the 
contextuality of a single three-level system. 

Entropic contextual inequality We now focus on a three- 
level system which can physically correspond to spin 1. 
Spin states are represented by rank 1 projectors A s n , 
where s = 0, ±1 and n denotes the direction of spin pro- 
jection. We say that A s n is measured if it is an eigen- 
projector of the measured observable. It is well known 
that the squares of spin 1 operators S% and for two 
orthogonal directions n and m can be jointly measured. 
An example of the observable corresponding to this mea- 
surement is S% — S^- We say that this observable pro- 
vides the context {A^ , A^}. However, the same operator 
can be jointly measured with any other operator S^, , 
where m! is confined to the plane orthogonal to n, hence 
there are many contexts for the measurement of ^4° . 

Let us derive an entropic contextual inequality analo- 
gous to the entropic Bell inequality in pQ . It involves five 
projectors {A\, A 2 , . . . , A 5 } (Ai — \Ai)(Ai\) on which we 
impose cyclic orthogonality restrictions, i.e. AiA i+4 = 0, 
where the subscript is modulo five. Neighboring pro- 
jectors are jointly measurable since they are orthogo- 
nal. As a result, for every projector Ai there exist two 
contexts {A^Aj-t!}. In case of spin 1 these projec- 
tors may correspond to A®, where i and i ± 1 denote 
orthogonal directions in real space. Let us start with 
the assumption that despite the fact that not all projec- 
tors are jointly measurable, there exists a joint proba- 
bility distribution for all five projectors p[A\, . . . , A 5 ). 
This joint probability distribution is a non-contextual 
description of the measurements {^4j}- ^ is then pos- 
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FIG. 3: Configuration of projectors leading to maximal vi- 
olation of the entropic contextual inequality for three-level 
systems. 

sible to define the joint entropy H(A 4 , A 2 , A 3 , A4, A5), 
where H(A) = —J2 a P(A — a )\°gp(A — a) denotes 
the Shannon entropy. Two classical properties of the 
Shannon entropy are used in the derivation of the en- 
tropic contextual inequality as in pQ. The first is the 
chain rule H(A, B) = H(A\B) + H(B) and the second is 
H(A\B) < H(A) < H(A, B). The latter inequality has 
the intuitive interpretation that two random variables 
cannot contain less information than one of them and 
that conditioning cannot increase information content of 
A. The conditional entropy above is defined as 

H(A\B)= ]T p{B = b)H(A\B = b), 

b=0,l 

where the entropy H(A\B = b) is defined like standard 
entropy, but using conditional probabilities p(A — a\B — 
b). The conditional entropy H(A\B) describes the infor- 
mation content of A given the value of B. 
Repeated application of the chain rule yields 

H(A 1 ,A 2 ,A 3 ,A 4 ,A 5 ) = H(A 1 \A 2 ,A 3 ,A 4 ,A 5 ) + 
H(A 2 \A 3 , A 4 , A 5 ) + H(A 3 \A 4 , A 5 ) + H(A 4 \A 5 ) + H{A 5 ). 

Using the inequality H(A\B) < H(A) < H(A,B), one 
then obtains the entropic contextual inequality 

H(Ai\A 5 ) < H(Ai\A 2 )+H(A 2 \A 3 )+H(A 3 \A i )+H(A 4: \A 5 ). 

(1) 

We define the quantity 

C = H(A 1 \A5)-H(A 1 \A2)~H(A 2 \A 3 )-H(A 3 \A 4 )-H(A 4 \A 5 ), 

therefore the inequality ([T]) can be rewritten as C < 0. 
For the three-level system the maximal violation of this 
inequality can be shown to be of magnitude 0.091 bits. 
The optimal solution can be written as follows with pa- 
rameters 6 = 0.2366 and if — 0.1698 (see Fig. [4] where 
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FIG. 4: The plot of positive part of C as a function of 9 and 
tp. Maximal violation of the entropic contextual inequality is 
seen to be 0.091 bits. 

we plot C as a function of and tp) 

I i\ I ■ n n n\T I a \ ( V COS 2 ¥> tan (p 1 \ 

\A 2 ) = (0,cos^,-sin^) T , \A 3 ) = (1,0, 0) T , 
\A 4 ) = (0,cos^,sin^) T , |A 5 ) = ^ - L_jj 

where x denotes the three-dimensional cross product. 
The projectors corresponding to \Ai) and \A 5 ), in addi- 
tion to orthogonality obey the symmetries (i) (A^tp) = 
(A^), (ii) (A 5 \A 2 ) = {A l \A i ) and (hi) (A 5 \A 3 ) = 
A3). These symmetries uniquely define \Ai) and 
|A 5 ). 

The intuitive reason for the appearance of these sym- 
metries in the optimal solution is the following. Max- 
imal violation of the entropic contextual inequality re- 
quires maximizing H(Ai\A 5 ) while simultaneously min- 
imizing the right-hand side of the inequality. For or- 
thogonal projectors A and B one has H(A\B) — p(B = 
0)H(A\B = 0). If A, B and the state |V>) are copla- 
nar, H(A\B) = because then p(A = 0) = p{B = 1) 
and p(A = 1) = p(B = 0). Therefore, we need to set 
all pairs of projectors corresponding to entropies on the 
right-hand side of the inequality as coplanar with \ip) as 
possible, whilst maximizing H(A\\A§). The symmetries 
listed above arise as a consequence of these considera- 
tions. Furthermore, numerical optimization over the five 
projectors and the state also reveals these symmetries for 
the solution. Note that any pure state of a three-level sys- 
tem violates inequality 0, the optimal projectors being 
obtained from the above solution by Euler rotations. 

Discussion It is important to notice that all entropies 
in the inequality ([I]) can be evaluated within quantum 
theory since they refer to jointly measurable quantities. 
Although the entropic inequality constructed here in- 
volves five projectors as in [8], it is not equivalent to the 
pentagram inequality constructed there. For the penta- 
gram inequality, violation is obtained if and only if the 
joint probability distribution does not exist. Violation of 
the entropic contextual inequality (nl) therefore implies 



violation of the pentagram inequality but the converse is 
not true. The optimal projectors for violation of ([T]) do 
not possess the symmetry of the projectors in the penta- 
gram. For the optimal projectors and state given above, 
the violation of the pentagram inequality is 0.049 which 
is less than the maximal value of \/5 — 2. The reason for 
the asymmetry of optimal projectors in the inequality ([I]) 
and in the Fig. [3] is that the projectors A\ and A 5 are 
special in the sense that H(A 1 \A 5 ) has to be maximized, 
whereas H (Ai\A i+ i) for i = 1, . . . , 4 has to be minimized. 
On the other hand, the pentagram inequality projectors 
are treated on equal footing. 

The pentagram inequality has been recently tested in 
the laboratory in [llj . a similar setup can be used to 
test the entropic inequality as well. Entropic contextual 
inequalities can be easily constructed for more projec- 
tors than five and applied to higher dimensional quantum 
; systems following the construction above. Since these in- 
equalities are not equivalent to those following from the 
approach in [8], an interesting problem is to investigate 
the set of quantum states that violates entropic inequal- 
ities as opposed to the set that violates the inequalities 
in [3]. The earlier approach is based on studying the 
extremal edges of a polyhedral cone, which leads to a 
finite set of inequalities that are hard to construct and 
interpret. Entropic contextual inequalities are simpler to 
construct and carry a clear information-theoretic inter- 
pretation. The violation of the entropic contextual in- 
equality indicates that the joint probability distribution 
does not exist. Insistence on a joint probability distribu- 
tion would result in negative information whose deficit 
is measured by the violation of the inequality. Moreover, 
for a single three-level system no entanglement exists and 
therefore violation of the entropic inequality is solely due 
to contextuality, unlike the entropic Bell inequality in [1 
where entanglement was necessary. It is interesting how 
these inequalities extend to macroscopic systems where 
entropies arise naturally in the context of thermodynam- 
ics. 

Conclusions In this paper we have constructed an en- 
tropic contextual inequality that can be applied to the 
simplest indivisible quantum system, namely a single 
three-level system. After analytically showing that the 
minimal commutation graph for this system is the 5- 
cycle confirming earlier observations, we constructed the 
optimal set of projectors that maximally violate this in- 
equality. Note that a different information-theoretic ap- 
proach to contextuality has been considered in [12] using 
the notion of min-entropy. Also, the constraints on the 
Shannon entropies of marginal probabilities from the ex- 
istence of a joint probability distribution for graphs that 
are n-cycles has been considered in [13? ]. The con- 
struction of the contextual inequalities considered in this 
paper can be easily extended to other measures of dis- 
order that obey the intuitive classical properties used in 
the proof. The entropic Bell inequality derived in [T] was 



generalized in 14J using the notion of mutual informa- 
tion, therefore an open question is to investigate which 
information-theoretic quantities optimally reveal the lack 
of objective realism in quantum systems. A further ques- 
tion is to fully analyze the relation between the entropic 
contextual inequality derived here and the pentagram in- 
equality. It would also be interesting to find optimal 
commutation graphs that reveal contextuality for given 
system dimensions |15) . This work is supported by the 
National Research Foundation and Ministry of Education 
in Singapore. 
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